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Abstract 



Let Z{F) be the number of solutions of a random /c-satisfiability formula F with n variables 
and clause density a. Assume that the probability that F is unsatisfiable is 0(1/ log(n)^+^) for 
£ > 0. We show that (possibly excluding a countable set of 'exceptional' a's) there exists a non- 
random function <f>{a) such that, for any S > 0, (1/n) log Z{F) € [4'—S, with high probability. 
In particular, the assumption holds for all a < 1, which proves the above concentration claim in 
' the whole satisfiability regime of random 2-SAT. We also extend these results to a broad class of 

, constraint satisfaction problems. 

The proof is based on an interpolation technique from spin-glass theory, and on an application 
of Friedgut's theorem on sharp thresholds for graph properties. 

^ ■ 1 Introduction and main results 
00 

■ Over the last twenty years, a considerable effort has been devoted to understanding the typical 
properties of random fc-satisfiability (/c-SAT) instances. This line of work was initially motivated by 
I the empirical discovery of a striking relation between the running time of standard solvers and the 
' proximity to the 'satisfiability threshold'. Since then, an important motivation for these investigations 
has been to develop better heuristics to cope with hard constraint satisfaction problems. 

Significant progress has been made along this path. In particular, it was estabilished early on 
that, indeed, the probability that a random instance is unsatisfiable increases sharply from to 
1 when the 'clause density' (number of clauses per variable) crosses a critical threshold |Fri99] . 
5^ I This phenomenon is referred to as the satisfiability phase transition or satisfiability threshold. The 
critical density might a priori depend on the number of variables, although upper and lower bounds 
are known to match up to a multiplicative constant that goes to 1 when k increases |ANP05| . 

On the other hand, a significantly more detailed picture has been conjectured, building on non- 
rigorous techniques from statistical physics such as the replica and cavity methods jMZK^99[ IMPZOSj 
lKMRT+n7[ IMMn9j . In particular, not only an n-independent critical density is conjectured to exist, 
but explicit values (depending on k) were computed in |MPZ03j . This type of analysis also lead 
to an intriguing picture of the geometry of the set of solutions of a random satisfiability instance 
|MPZ03t IKMRT+OTI IMM09j . While only a smah subset of these results have been estabihshed 
rigorously, they provided guidance and stimulus for exciting rigorous developments |ACO08j . 
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In this paper we explore the most basic property of the set of solutions of a random fc-SAT 
instance: its cardinality. The problem of computing the number of solution of a /c-satisfiability 
(fc-SAT) formula is well known to be ^P-complete for any k > 2. Even worse, there is no fully 
polynomial randomized approximation scheme (FPRAS) to approximapte the number of solutions 
unless NP = RP |DGGJ03| . 

Here we are instead interested in asymptotic estimates of the number of solutions of a random 
formula, in the following sense. Let F{n,a) denote a formula on n variables with clause density 
a (each clause having k literals) and let Z{F{n,a)) be the number of solution of F{n,a). From a 
statistical physics viewpoint, one of the most basic conjectures is that Z{F{n, a)) concentrates on the 
exponential scale. Namely, for each a < as{k) (the satisfiability threshold), there exists (j) = </>(a) 
non-random such that, for any 5 > 0, 2"*^'^"'') < Z{F{n,a)) < 2"'('^"'"'^) with high probability. In 
formula 

lim P{2"(<^-'5) < Z{F{n, a)) < 2"('^+'^)} = 1 . (1.1) 

n— >oo 

While for arbitrary A; > 3 we cannot establish this conjecture for any a below the satisfiable threshold 
(which would imply in particular the existence of an n-independent satisfiability threshold), we are 
able to prove that (jl.ip holds for any a such that F{Z{F{n, a)) = 0} (the unsatisfiability probability) 
is upper bounded by l/(log n)^"'"^ for some e > and all n large enough. In particular, this establishes 
the conjecture for k = 2 (in the entire satisfiable phase), and for k > 3 and a < 1. A verification of 
previous assumption for arbitrary values of k and a up to the so-called 'dynamic threshold' is also 
discussed. 

We further generalize these results to a broad family of constraint satisfaction problems. 

One basic difficulty in estabilishing (jl.l|) is that the concentration of log Z{F) cannot be proved 
using standard martingale methods. Such an argument typically requires to control the difference 
\log Z{F') — log Z(F)\ for F and F' differing in a single clause [LedOlj . Unfortunately, adding a 
single clause can change the value of Z{F) from exponentially large to Z{F') = 0. 

In order to overcome this difficulty, we use Friedgut's theorem to prove that the property Z{F) < 
2"''^ has a sharp threshold. We then translate this result into a concentration statement by proving 
that (l/n)Elog(l + Z{F)) converges, using an interpolation technique. This is where the condition 
on the decay of the unsatisfiability probability is used, since the interpolation technique does not 
lead to a formal superadditivity property in this setting. 

The interpolation method we use was first developed by Francesco Guerra and Fabio Toninelli in 
the context of spin glass theory |GT02j and then generalized to a number of problems from statistical 
physics, computer science |FL031 [FLT031 |PT04j and coding theory |Mon051 rKM09| . A nice suite of 
combinatorics applications were recently presented in [BGTlOj . 

In particular, the interpolation technique was applied to random /c-SAT in |FL031[FLT03j . to show 
that a suitable normalized log-partition function has a limit for n — )• oo. The same work implies 
that the minimum number of unsatisfied clauses per variable has a limit as well. Let us emphasize 
a crucial difference between such results and the conjecture (jl.ip : The quantities (normalized log- 
partition function or optimal cost) considered in [FL031 IFLT03j and following-up work are easily 
proved to concentrate via martingale methods. This considerably simplifies those proofs. 

Acknowlegements. The present research was initiated as far back as 2006, and remained 
dormant for a long period of time. Some results were presented at the 'DIMACS Working Group on 
Message-Passing Algorithms' in October 2008. We were finally motivated to polish and publish the 
manuscript after discussions with Mohsen Bayati, David Gamarnik and Prasad Tetali regarding their 
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a careful reading of some parts of the manuscript and for useful suggestions. 

This work was partially supported by a Terman fellowship, the NSF CAREER award CCF- 
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2 Random fc-SAT 

Definition 1. A fc-clause is a disjunction of k Boolean variables or their negations. Let Ck{n) 
be the set of all N = (^)2*^ possible /c-clauses on n Boolean variables. We denote by Fk{n,a) a 
random formula which is formed by selecting independently each element in Ck{n) with probability 
Pk{n,a) = an/ {(^^2^), and by taking the conjunction of the selected clauses. 

The number of clauses in the above model is a binomial random variable, which concentrates 
exponentially fast around its expectation an. Some of our computations prove to be simpler within 
slightly different models, whereby the number of clauses is either Poisson or deterministic with the 
same mean an. Standard monotonicity arguments can be used to show the equivalence of these 
models for our purposes and we will hence switch freely between these different models. 

Unless specified, the value of k will remain fixed throughout the paper and the k subscript is 
dropped. 

We denote by Z{F) the number of satisfying assignments (solutions) of a Boolean formula F and 

by 

Pn{aA) :=P{^(F(n,a)) <2"'^}, 

the probability that a random formula has less than 2""^ satisfying assignments. 

Definition 2. Note that Pn{oi^ 0) = P{Z(F(n, q)) = 0} is the probability that F{n^ a) is unsatisfiable 
(UNSAT). We define 

a* := sup{ a : P„(a,0) = 0{l/{\ognf+''), for some e > O) . 

Remark 1. the results in this paper still hold when defining a* to be sup{a : Yin Pri{oi, 0)/n < oo}. 
However, the above definition simplifies the proofs without much loss of generality. 
Remark 2. For any /c > 2, we have a* > 1. Indeed, considering the case of 2-SAT, jdlV92] 
proves that, for a < 1, P{Z(F2(n,a)) = 0} = 0{l/n). Since for any A; > 3, P{Z(Ffc(n,a)) = 0} < 
P{Z(F2(n,Q;)) = 0}, we conclude 

a* > 1 

for /c > 3 as well. 

Unfortunately, the bounds on the satisfiability threshold based on the second moment method 
[ANPOSj do not imply any quantitative estimate on the probability that Fk{n,a) is UNSAT. It 
might be possible to prove such an estimate by a careful analysis of specific solution algorithms. 
In particular, a careful analysis of the recent algorithm |CO09j might lead to a proof of a* > 
2'^(1 — 6)logk/k, for k large enough [COlOj . We expect that a* does coincide with the satisfiability 
threshold. 

Our main result estabilishes the conjecture (jl.ip for a < a*, apart possibly for countably many 
'special' values of a. 
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Theorem 1. There exist a countable set C C [0,a*), and 4>s '■ [0,a*) — )■ [0,1] such that the following 
holds. For any e > and any a € [0,a*) \ C, we have 

lim P„(a,0s(a) - e) = 0, 

n— >oo 

lim P„(a, (/)s(a) + e) = 1. 

n— >-oo 

In order to prove this resuh, which is done in Section 14.4^ we first show the fohowing sharp 
threshold resuh. 

Theorem 2. For any e > and (/) G [0, 1), there exists {an.(</')}nez+ such that 

lim P„(a„(0) - = 0, 

n— )-oo 

lim f„(a„((/)) + = 1. 

n— >oo 

In other words, for any fixed (j), the property {Z{F{n, a)) < 2"*^} has a sharp threshold in a. In 
|Fri99j ■ this result is shown for (f) = 0, i.e., for the property {F(n,a) is UNSAT}. As for the (j) = 
case. Theorem [2] is proved by showing that the monotone property {Z[F[n,a)) < 2'^'^} cannot be 
approximated by a "local property" , and hence must have a sharp threshold. For greater convenience 
of the reader, and in order to introduce relevant notations, we reproduce in Section HTT] the Friedgut's 
Theorem on sharp thresholds for monotone properties |Fri99| . The proof of Theorem [2] is differed to 
Section ITU 

In order to prove Theorem (H we then would like to transfer the threshold in a (Theorem [2]) 
into a threshold in (p. This step is however not straightforward because of the little knowledge we 
have about the function (p i— )• an{(j>)- In order to establish this threshold transfer, we first prove the 
following result, which shows the existence of the n — >• oo limit of the normalized logarithm of the 
number of solutions when a < a*. 

Theorem 3. Let 

Ma) ■■= -E[logZ{F{n,a))\Z{F{n,a)) > 1]. 
n 

We have that V'n(o) converges to a limit 0s (a), for every a < a* . 

This theorem is proved in Section 14.31 and the key step in the proof consists in establishing the 
following pseudo-sup eradditivity property. 

Lemma 1. For any a, let Zn := Z(F{n,a)), we then have for any ni,n2 > k 

Elog(l + Zn,+n,) > IElog(l + Zn.Zn,) . 

The proof of Lemma [1] is differed to Section 14.31 and is based on the interpolation technique by 
Guerra and Toninelli |GT02] . and Franz-Leone |FL03j . However, while in those cases one obtains 
sup eradditivity of ElogZ, in the present case we get a weaker result because of the term. 
Hence, in this case the existence of the infinite volume limit is not a straightforward consequence 
of Lemma [TJ Notice that this problem is intimately related to the fact that Z = with positive 
probability, and therefore Elog.Z^ is not defined. We use the fact that a < a* to circumvent this 
problem in Theorem [3l 

Finally, Theorem [3] implies the following result, which allows the complete the threshold transfer. 

Lemma 2. For each n and (p, let ani4>) be such that Pn{(j),an{(p)) = 1/2. We have that a„(0) 
converges for almost every cp E (ps{[0,a*)) (where cps is defined in Theorem\3^. 

Lemma [2] is proved in Section [4.41 
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3 A general family of random CSP's 

In this section, we extend the resuhs of the previous section to a general family of random constraint 
satisfaction problems (CSP) over binary variables. An ensemble in this family is defined as follows. 

Definition 3. Let ^ be a distribution over Boolean functions : { — 1,+!}'^ — t- {0,1}, which we 
call the clause type distribution. Let n be an integer and a £ M+. A random formula from the 
ensemble Fi:{a,n, ^) is drawn as follows. For each a S {1, . . . ,m = [anj} the a-th clause is drawn 
independently from previous ones. For clause o, k indices ii(a), . . . ,ikia) are drawn independently 
and uniformly at random in [n]. Further ifa : { — 1, +1}'^ — ?• {0, 1} is drawn under the distribution n, 
producing the clause ^a{xi,{a), • • • , 3;ife(a))- 

An assignment x S {+1,-1}" is said to satisfy the formula Fk{a,n, ^) if, for each a G [m], we 
have V^a (a^ii (a), •• -,2:^^(0)) = 1- 

As in previous section, we will often drop the subscripts k in the following. Further, Z{F) is the 
number of satisfying assignments of formula F and we define 

P„(a, 0, ^) := P{Z(F(q, n, fi)) < 2""^} . 

Definition 4. Note that Pn{oi,0, fj.) = ¥{Z{F{a,n, fi)) = 0} is the probability that F{a,n,fi) is 
UNSAT. We define 

a*{fi) := sup {a : P„(a,0,/i) = 0(l/(log n)^^^) for some e > O] . 
Definition 5. For 93 : {-1, 1}'^ {0, 1} and 6 G [-1, 1], let 

ll'^lle = X] ^{xfve{x) and \\ip\\ = \\ip\\o 

where 

Mx) = [[ — ^ — • 

i=l 

Note that is the probability that ip = 1 under the measure vg, which assigns probabilities 
(1 - e)/2 and (1 + 6l)/2 to -1 and +1 respectively. 

Our CSP ensemble is specified by the distribution fj, over clause types and we now describe two 
set of hypotheses on this distribution. 

HI. (a) Dominance of balanced assignments. For every 6 £ [—1,1], E,^ log ||(/?||6i < log ||(/?||, with 
equality only if ^ = 0. This condition implies that, in a typical random instance, most solutions have 
almost as many I's as — I's. 

(b) Unsatisfiability of uniform assignments. For every s G {— 1,+1}, there is at least one clause ip 
with ^{(f) > such that ip{s, . . . , s) = 0. 

H2. Convexity ofVi. Let Mi({— 1, 1}') be the set of probability measure on {-1,1}' and let 

: Mi({-l,l}') 

be defined by 

I 

F^i^) :=E^E^M J](1-^(ZM)) (3.1) 

r=l 
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where Z^^^ are Boolean random vectors of dimension k such that Zj = {Z^ , . . . , Z^ ), i = 1, . . . ,k, 
are i.i.d. with distribution v, whereas 99 is a random clause type drawn under distribution Under 
H2, ij, is assumed to make F/ convex for for any / > 1. 

Notice that conditions HI. (a) and Hl.(b) coincide with conditions 4 and 5 in |MRT09j . Further, 
hypothesis HI is satisfied by a number of interesting random CSP ensembles. Such examples include 

• A:-NAE-SAT, where ip{x) = ips{x) = t{x ^ {-s,s}) and = for each s G {-1, 1}^=; 

• Hypergraph 2-coloring, where •^(x) =\{x ^ | — 1; is the unique clause in the support of 
/i, with ^ = (-1, . . . , -1) and +1 = (+1, . . . , +1); 

• /c-XOR-SAT, where '^{x) = (/^^(a;) = IdliLi ^« — s) ^'^d = 1/2 for each s S {— 1, 1}; 

• /c-SAT, where '^{x) = (ps{x) = l(x / s) and fJ,{ips) = for each s G { — 1, l}'^- 

For the first three examples above, it is checked in |MRT09j that hypothesis HI is satisfied. Let us 
check that this is the case for /c-SAT as well. Note that 

^1 

EsM\j = 1 - E, n = 1 - 2-*= = Esyf, 

i=l 

hence 

Eslog \\ip\\l < logEs\\(p\\l = logEs||(/?|p = Eslog||(/?|p. 

This verifies condition HI. (a). Condition Hl.(b) holds trivially. 

Hypothesis H2 is not straightforward to check. The next definition characterizes a family of 
clause type distributions satisfying it. 

Definition 6 (/c-factorizing distributions). A clause type distribution n is said to k-factorize if it 
has the following structure. There exists an integer J > 1, such that any 99 G supp(/i) is of the form 

(^(x) = l(x^{s«,...,s(^)}), (3.2) 

for some s^^\ . . . , s^-^^ G Ij'^i 

k 

f,{^) = l[fl{s['\...,si'^) (3.3) 

1=1 

where /i is a probability distribution on { — 1, l}"'. In other words, the vectors {s\^\ . . . ,8^^), for 

i = 1, . . . ,k, can have correlated components but are mutually i.i.d. with distribution fl. 

This definition can be generalized by letting J itself to be random, but we stick to the above case 
for the sake of simplicity. 

The class of /c-factorizing clause type distributions includes, among other problems: 

• /c-NAE-SAT: ip{x) = l{x ^ s}) for s G { — 1, +1}'^ uniformly random. This is /c-factorizing 
with fL{-l, 1) = /2(1, -1) = 1/2; 

• Hypergraph 2-coloring: (p{x) = l{x ^ | — 1, +1 |) with /x(— 1,1) = 1; 
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• k-SAT: v3(x) = t{x ^ {s}) with /2(1) = = 1/2. 

Condition H2 is satisfied by /c-factorizing distributions as stated formally below. 

Lemma 3. The mapping Ti is convex for any I > 1 if the clause type distribution k-factorizes. 

Note that /c-XOR-SAT does not belong to this class of distributions, nevertheless, condition H2 
holds in this case as well, as stated below. 

Lemma 4. The mapping Ti is convex for any I > 1 for k-XOR-SAT with k even. 

The proofs of Lemma [3] and Lemma H] are differed to Section [4.51 We now state the equivalent of 
Theorem [1] for this general class of CSPs. 

Theorem 4. Assume to satisfy conditions HI and H2. Then there exists a countable set C and a 
function a i— )• 0s(a) such that, for any a G [0, a*{p)) \ C, and any e > 0, 

lim Pn{a,(j)s{a) - £,fi) = 0, 

n— >oo 

lim Pn{a,(l)s{a) + £, fi) = 1. 

n— ^oo 

As in previous section, the proof of this theorem is based on the following two theorems. 

Theorem 5. For any ^ satisfying HI and (j) G [0,1), there exists {an(0)}nGZ+ such that for any 
e>0, 

lim P„ ^) = 0, 
lim P„(a„(0) ^) = 1. 

n— >oo 

This theorem is proved in jMRT09j . Although in that paper a larger set of conditions was required 
in the formal statement, by simple inspection of the proof it follows that only conditions HI. (a) and 
Hl.(b) were in fact used. 

Theorem 6. Let 

Ma) ■■= -niog Z{F{a,n, fi))\Z{F{a,n, fi)) > 1]. 
n 

For any /i satisfying H2 and for any a < a*(/x), il^nici) converges to a limit (psict). 

The proof of this theorem is based on the following pseudo-superaddittivity lemma. 
Lemma 5. For any a and ^ satisfying H2, let Zn ■= Z{F{a, n, fj,)), we then have for any ni,n2 > k, 

Elog(l + Z„, 

+n2 

) >Elog(l + Z„,Z„J. 

The proof of Lemma [5] is differed to Section 14. 5[ The proofs of Theorem [6] and Theorem U] follow 
the same analytical arguments as for the /c-SAT case and hence we omit the details here and refer 
to the proofs of Theorem [3] and Theorem [TJ 
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4 Proofs 



4.1 Friedgut's theorem: A reminder 

Let Pp be the product measure on {0, 1}^, such that for {zi, . . . , zn) G {0, 1}^, Pp(-?i, • • • , zn) = 
pY.7=i^^{l—p)^~T.7=i^\ IfiV = (^) 2'^, the space {0, 1}^ is isomorphic to the space of A;-CNF formulas 
on n variables and the measure Pp is the measure used in this paper for random A;-SAT. We say that 
^ ^ {0, 1}^ is monotone if, for any x G {0, ,y & A, x < y (component- wise) implies y A. Such 
a collection of vectors A is also called a monotone property, and note that by monotone it is really 
meant here monotonically increasing. When = (^)2'^, a property is called symmetric, equivalently 
called a formula property, if it is invariant under /c-CNF formula automorphisms (invariant under 
the action of the wreath product of the symmetric group Sn with k copies of Z2). For instance, the 
UNSAT property of a /c-CNF formula is monotone and symmetric. 

Note that for N fixed and a monotone symmetric property A = Ai^f, the function p 1— )■ Pp(^) 
is a monotonically increasing polynomial with Po(^) = and Pi(yl) = 1 (strictly increasing if A ^ 
{0, 1}^). We can therefore define for s G [0, 1], Pe such that Fp^{A) = e and we call (5(e) = pi~e — Pe 
the critical interval. We then say that the property A (or the sequence of properties Ajsf) has a sharp 
threshold if for all £ £ (0, 1), 5(e)/pi/2 tends to zero when increases. Note that for a monotone 
property A having a sharp threshold, there exists p G {ps,pi~e) such that Pp(^) — )■ if p < p{l — e) 
and Fp{A) ^ 1 if p > p(l + e). 

If instead, for e G (0,1), 5(e)/pi/2 is bounded away from zero, we say that A has a coarse 
threshold. In typical examples, S{e)/pi/2 is bounded and hence admits subsequential limits. In order 
to prove a sharp threshold for monotone properties, it is therefore sufficient to rule out the case of a 
coarse threshold (modulo reducing to subsequences). If A has a coarse threshold, then there exists 
P* £ {Pe,Pi-e) such that p* ■ ^Pp(yl)| , < 1/C uniformly in A^. 

For a given formula F, we denote by \F\ the number of clauses in F. The average degree of 
a formula F is the ratio between the number of variables and the number of clauses in F, and a 
balanced formula is a formula whose average degree is no less than that of any sub-formula. We also 
denote by E{F) the expected number of sub-formulas isomorphic to F in a random formula. We 
now state the result of Friedgut. 

Theorem 7 ( |Fri99j ) . Let < /3 < 1. There exist functions B = B{e,c), hi = 6i(e, c) and 62 = 
62(e,c) such that for any N , c > 0, e > 0, p and any monotone symmetric family A of k-CNF 
formulas with n variables such that p ■ ■§;^Fq{A)\^ < c and /3 < Pp(^) < 1 — /3, there exists a formula 
G satisfying: 

• G is balanced 

• bi< E{G) < 62 

• \G\<B 

• Pp{A|G} > 1 — e, where ¥p{A\G} denotes the probability that a random formula belongs to A 
conditioned on the appearance of a specific copy of G in the random formula. 

Note that conditioning of the appearance of a formula satisfying the above hypothesis is not the 
same as conditioning on a specific copy of such a formula (with specified variables). 
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4.2 Proof of Theorem [2] 

In order to prove Theorem [2l let us assume that the property {Z{F{n,a)) < 2"*^} has a coarse 
threshold. Let us consider ps in the critical interval of this property such that ps ■ ^Pp(A)| < c and 
define ps = agn/N. By compactness there exists a subsequence of size n along which P„(as, <j)) has a 
limit Poo = [^5 1 — £]• For simplicity of notation, consider P^o = 1/2 (the general case is completely 
analogous). We can hence assume w.l.o.g. that as = cts{n) is such that P„(as,</>) = 1/2 (and as is 
well defined since Pn is continuous and strictly increasing in a). 

The proof consists in showing that there does not exists a specific "short" formula G of the 
kind described in Theorem [71 providing a contradiction with the coarse threshold assumption. We 
proceed by showing that the third assumption on G, the bounded number of clauses, leads directly 
to a contraction with the fourth one. This is shown with the following lemma. 

Lemma 6. Let G be a satisfiable formula with a finite number of variables and clauses. For any 
e > 0, we have for n large enough 

P{Z(F(n,as)) < 2"<^|G} < 1/2 + e, 

where ¥{ ■ \ G} is interpreted as in the statement of Theorem^ 

Before proceeding to the proof of Lemma [U we need the following technical estimates for the 
case (j) > 

Lemma 7. For any n, a, <j), tp, with (j) > ip, 

(i) |-P„(a,0) <CiV^, 

(ii) Pn{a, (P) - P„(a, ^) < G2i(l) - V')\/^log(n) + o(l), 
where Gi and G2 do not depend on n. 

Proof, (i) By definition 

P„(a,0)= y f^V™(l-p)^-™lP{^(i^(n,a))<2"*||P|=m}, 



0<m<Ar 



and p = {na)/N, thus 



T?-P'"(l - P)^"™ = —p"'~\l - p)^-"'-\m - Np) 
da N 



and 



(^^P^'i'^ - P)^~"'im - Np)F{Z{F{n,a)) < 2"'^||P| = m} 



0<m<N 

< ^p-'{l-p)-'mF\-Np),\F\>Np] 

< ^p-H^ - prHn\F\ - Npf)'/^ 
= ^p-H^-prHNp^-p))'^' 

< GiV^, 
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(ii) Note that because of Lemma [7] (i), 

a^F{Z{F{n,a)) < 2"^} 
is Lipschitz with constant Cl^/n. Therefore, it is sufhcient to show that for any a > 

P{Z(F(n,a)) < 2''+^} < F{Z{F{n,a + Alog{n)/n)) < 2^} + clog(n)/V^, (4.1) 
for some constants c and A (possibly depending on k), since then 

F{Z{F{n, a)) < 2"+^} - ¥{Z{F{n, a)) < 2"} 

< F{Z{F{n,a + Alog{n)/n)) < 2"} -F{Z{F{n,a)) < 2"} + c\og{n) / 

< C2log(n)/V^, 



for some constant C2. We first verify (14.ip for the model where the formulas have exactly an clauses 
drawn uniformly at random, using in that case the notation F^n- The result for the Binomial model 
follows then from standard monotonicity arguments. We will show that there exists G (0, 1) such 
that for any a > and any integer / > 1, 

P{Z(F„„) < 2'^+n - P{Z(F,„+;) < 2"} < 26'. (4.2) 

Note that for a deterministic formula D, and C a /c-clause uniformly drawn in Ck{n), 

KcZ{DAC) = {l-^)ZiD), 

hence for I uniformly drawn A;-clauses 

^Cr,...,c,Z{D A Ci A . . . A Q) < (1 - ^yZiD). 

Therefore, denoting by E/ = ^Ci,...,Ci the expectation with respect to Ci, . . . , C;, we get 

nZ{Fan) < 2"+'} < F{EiZ{Fan+l) < 2^+1(1 - ^Y}, 
and defining T := 2(1 — 2"^)\ we have 

nZ{F^n+i) < 2"} > nZ{F^n+i) < 2^\EiZ{F^r.+i) < T2^}n^iZ{F^,,+i) < T2^} 
> (1 - T)F{EiZ{F^^+i) < T2^}, 



where last inequality follows from Markov's inequality. Thus, 

P{Z(F,„) < 2'^+n - P{Z(F„„+i) < 2"} < 2(1 - ^) 



By setting / = A{k)log{n), previous upper bound is 2n'^^''^ ^"^^^ \ hence by taking A{k) appropri- 
ately we get ([ir]) . □ 

We are now in position to prove Lemma El 
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Proof of Lemma O Let G be a satisfiable formula with a bounded number of clauses and, say, r 
variables. And let us assume without loss of generality that G contains only variables xi, . . . ,Xr, and 
that it is satisfied when all variables are set to true. We then have 

F{Z{F{n,a,)) < 2"*|G} < F{Z(F(n, a,)U,=...=,,=T) < 2^^}, 

where T refers to the true assignment. The random formula F{n, as)\xi=...=xr=T has now a different 
structure and probability distribution. Let us denote by an equivalently distributed random 
formula, which contains clauses of size 1 to k and has only n — r variables. Since |Cfc(n)| = (^)2'', 
the number of /-clauses appearing in F*, denoted by Si, satisfies 

ESk = e(n), (4.3) 

ESk-i = 0(1), (4.4) 

and more generally 

ESk-i = 0{n^-'), yi<i<k-l. (4.5) 

Hence, defining a random formula -F**, which contains only /e-clauses and {k — l)-clauses, with the 
/c-clauses selected independently in Ck{n — r) with probability p = ^"j^^ , and exactly d clauses of 

size {k — 1) uniformly selected in Cfc_i(n — r), we get that for any r > and n large enough 

F{Z{F{n, as)) < 2"<^|G} < F{Z(F„) < 2"'^} + r , 

provided d is large enough. Since 1-clauses are more constraining than (k — l)-clauses, we can upper 
bound our estimate by replacing each /c-clause by the disjunction of its k literals. Moreover, the 
clauses on the n — r variables are drawn with probability p{n, Og) = asn/{(^^2''}, hence, by drawing 
them from p{n — r, Og) instead we get, for D = d{k — 1), 

P{Z(F(n, as)) < 2^*\G} < F{Z{F{n - r, a,) AfLi ^) < 2"<^} + r. (4.6) 
We now prove a useful fact. 

Remark 1. Let / be a Boolean formula on n variables, C^''^ be /-clauses independently and uniformly 
selected in Ciin) and y G M. Then for any unbounded increasing sequence h{n) and for any e > 0, 
we can take n large enough such that, 

nZ{f aF=i ^) < y} < nZ{f Cf ^) < k^'y} + e. 
Proof. In order to prove this fact, we check that for n large enough, 

¥{Z{f A cP) < y} < ¥{Z{f aS^/^ )) < ky} + e/D. (4.7) 
Note that for any g > 1 

nZ{f a!=i Cf ^) < ky} > P{ min Z{f A ^) < ky} (4.8) 

i<«<a 
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Moreover, 



¥{Z{f A < ky} = nZif A VUCI'>) < ky} 

k 

>F{J2z{f/\CP)<ky} 



i=l 

> F{Z{f A ^) < y, VI < i < A;} 

= F{Z{fAC['^)<y}\ (4.9) 
Therefore, putting (j4.8p and (|4.9p together, we get 

P{Z(/Ati Cf))<%} > l-(l-P{Z(/ACf))<y}^y, 
but this imphes that we can take g large enough, such that 

F{Z{f A C['^) <y}< F{Z(/ Ati cf < fcy} + e/D, 
which proves the remark. □ 
We now can use our remark to upper bound the estimate in Eq. (j4.6p 

P{Z(F(n,as)) < 2"'^|G} < F{Z{F{n - r,a,) A^l"^ cf ^) < A;^2"'^} + 2r (4.10) 

and by taking t large enough 

F{Z{F{n - r,as) aJI"^ ^) < A:^2"'^} < P{Z(F(?i - r,as + t/i(n)/(7i - r)) < A;^2"<^} + r. (4.11) 

Defining H{n) := th{n)/[n — r), we get from Lemma[7] {ii) 

F{Z{F{n - r, as + H{n))) < k^r^2''''-''^^} < P{Z(F(?i - r, Os + i/(n))) < 2("-^)<^} + r. (4.12) 

Note that by Lemma [7] (ii), the inequality in (j4.12p holds for a clause density which is independent 
of n, although here the clause density is ctsin) + H[n). Since we will pick H[n) to be o(l/-y/n), hence 
the variation of H{n) can be neglected. Regarding the variation of as{n), note that this sequence 
fluctuates on a compact interval (for a fixed (/) < 1, on an interval contained in (0, A] where A is 
an upper bound on the critical threshold of Friedgut |Fri99j ) . then one can check that the gap in 
this inequality, i.e., i-'n(a, </>) — -Pn(a, (j) + 1/f^), tends to zero uniformly in a, leading to the claimed 
inequality. 

Putting dHO]), diH]) and (|4J2]l together, we get 

F{Z{F{n, as)) < 2"'^|G} < F{Z{F{n - r, + H{n))) < 2("-^)<^} + 4r 

= Pn-r{as + H{n),(t)) + AT 

<Pn{as + 2H{n),(t>) + 5T. (4.13) 

To see that the inequality (j4.13p holds, let us check that for a, (j) fixed, g{n) increasing and n large 
enough 

Pn-i{a,(t>)<Pn{a + 9{n)/n,,i)) + T. (4.14) 
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The inequality (j4.13p can then be verified by an appropriate choice of g{n) and by using a similar 
argument as discussed previously regarding the dependence in n of the clause density. Recall that 
a random formula in F{n — l,a) is drawn by picking each clause in Cfc(n — 1) with probability 
p{n — l,a) = -^^Tj^- By a coupling argument, since p{n — l,a) > p{n,a), one can equivalently 

draw a first formula Fi by picking each clause in Cfc(n — 1) with probability p{n,a) and a second 
formula F2 by picking each clause in Cfc(n — 1) with probability p{n — l,a) — p{n, a); creating the 
formula Fi A F2. Note that a random formula F{n,a) picks each clause in Ck{n — 1) and also in 
{Cfc(n) — C : C £ Ck{n — 1)} with probability p{a,n). Hence 

Pn-i{aA) = nZ{F{n - l,a)) < 2("-^)<^} 
= ¥{Z{Fi A F2) < 2("-^)'^} 

< F{Z{F{n, a) A F2) < 2("-^)<^}. (4.15) 

The expected number of clauses in F2 is given by EIF2I = |Cfc(n — l)|(p(n — 1, q) — p{n, a)) = 0(1), 
hence, we can upper bound (j4.15p by replacing F2 with a constant number of random 1-clauses and 
use Remark [1] (as done above) to conclude that 

P„_i(a, (/)) < F{Z{F{7i, a + g{n)/n)) < K2^''-^^'f'} (4.16) 

for a constant K and an increasing function g{n). (Note that the 1-clauses are drawn within the set 
of n — 1 variables instead of n variables, but this does not change the conclusion). Finally, using 
Lemma [7] (ii), we get 

Pn-i(a, ^) < P{Z(F(n, a + gin)/n) < 2"'^} + r = P„(q + g{n)/n, ct>) + r, (4.17) 
which proves (I4.14p . Hence, we have 

P{Z(F(n, as)) < 2"^|G} < Pn{a, + 2H{n),4>) + 5t 
and by choosing h{n) = o{^/n) (increasing), i.e., H{n) = o{l/^/n)^ and using Lemma[7](i), we get 

P{Z(F(n,as)) < 2'^^IG} < P„(as, <^) + 6t. 

□ 

4.3 Proofs of Theorem [3] and Lemma [1] 

Proof of LemmalJl We refer to the proof of Lemma [5] which is more general. □ 

In order to prove Theorem [3l we first need the following technical lemma. 

Lemma 8. Let A(n) = 0(n/(log n)^^'') for some e > 0, and t{n) = o{n). Let /(•) be positive, such 
that f(n)/n is bounded above and 

f{ni + 71-2) + A(ni + 112) > f{ni) + /(n2), Vni, n2 > t{ni + n2). 

Then f{n)/n converges. 

Remark 3. This lemma still holds if A(n) is such that ^„ < 
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Proof. Let e > 0. Since f{n)/n is bounded above, we have S := lim sup„ /(n)/n < oo. Let no 
large enough such that f{n)/n < S + e for any n > uq. Let r be an integer (to be chosen at our 
convenience but to be kept fixed) and let 7(n) such that 7(n) > t{n) and 7(ra) = o{n). Define 

m{n) = inf{r2^' : r2^' > 7(n)} 

and note that 7(n) < m{n) < 2-y{n) V r. For any n, we then have 

n = [n/m{n) — lJ?Ti(n) + q{n), 

where q{n) G [m{n),2m{n)], with [n/m{n) — Ij > for n large enough. Hence, using the property 
of /, we have 

[n/m(n) — Ij 

f{n)>[n/m{n)-l\f{m{n)) + f{qin))- ^ A{q{n) + km{n)) , 

k=l 

and since / is positive, 

M > !I^LnAn(n) - - i ^^T'"'^ A(,(n) + fcm(n)). (4.18) 

n ~ n ' ^ ^ ^ m(n) n ^ 

^ ^ fe=i 

Since 7(n) = o(n), for n large enough, we have 

m(n) 



n 



[n/m(?i) — Ij > 1 — e. 



We now show that for n large enough, we also have •^^'"/"^^^ > S — e. First, note that we can take r 

^ ^ ' m(n) ' 

large enough such that r2^ /2 > 'y[r2^) for all k > 0, since 7(n) = o(n). Hence, using the property 
of /, we have 



and 

A: 



/(r-2^-) ^ /(r) 



5]-LA(r2^). (4.19) 



r2^ r ^—^ r2^ 

i=i 

Since we can pick r at our convenience, note that if r is a power of 2, 

k loggr+fc 
j=l j=log2r+l 

which is, when r increases, tending to zero uniformly in k, provided that 

oo ^ 

^-A(2^)<oo. 

i=i 

Since previous condition follows from our hypothesis on A, and since we can always take r large 
enough to ensure that f{r)/r > S — e, we can take r large enough such that, from (j4.19p . we have 
for any k 

r2^ - 
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and since m{n) is of the form r2'^, for any n 

m[n) 

Finally, we need to show that the last term in (|4.18p is vanishing, i.e., that 

[n/m(n)-lj 
^ k=l 

For this, we pick 7(n) to be large enough. For example, if A = O {n / [log nY^^), we have 

^ [n/m(n) -IJ ^ ^ 

V A{q{n) + km{n)) < ^A(n), (4.20) 

k=i ^ ' 

and since nn{n) > 7(n), if 7(n) = 0(n/(log n)^+'') with u > £, we conclude the proof. In general, we 
pick 7(n) such that A(?i)/7(n) = o(l). □ 

Proof of Theorem\^ Let Fn = F{n,a). Note that 

Elog(l + Z(F„)) =E[log(l + Z(F„)),Z(F„) > 1] 

and 

E[log(l + Z(F„)),Z(F„) > 1] 

= E[logZ {Fn),Z{Fn) > l]+E[log{l + Z{Fn)-^),Z{Fn) > 1]. 

Let c > 0, we have 

E[log(l + > 1] < E[Z{Fn)-\Z{Fn) > 1] 

< E[Z{Fn)-\Z{Fn) > 1,Zfv > cn]+¥{ZFv < cn,Z{Fn) > 1} 

where Zpy is the number of free variables in Fn. Therefore, E[Z (F^)^^ , Z (Fn) > 1,Zfy > cn] < 
Q-cn^ Moreover, there exists c, C2 > such that 

¥{Zfv < cn,Z{Fn) > 1} = 0(e-'^2") 

hence there exists ^ > such that 

r(n) := E[log(l + > 1] = 0(e-«"). (4.21) 

On the other hand, we have (denoting by F„j , two independent formulas and letting n = 721 + ^2) 

Elog(l + Z(F„jZ(F„,)) =E[log(l + Z(F„jZ(F„J),Z(F„jZ(F„J > 1] 

and 

E[log{l + Z{Fn,)Z{Fn,)),Z{Fn,)Z{Fn,) > 1] > E[log(Z(F„ JZ(F„,)), Z(F„JZ(F„,) > 1]. 
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Hence, using Lemma [H we get the following inequality 

E[logZ(F„),Z(F„) > 1] + r(n) > E[log(Z(F„jZ(F„J), Z(F„JZ(F„,) > 1] 
or equivalently 

gin) + T{n) > g{ni) + g{n2) - g{ni)e{n2) - g{n2)e{ni), Mni,n2>k (4.22) 

where 

g{n) = E[logZ(F„),Z(F„) > 1], 
e(n) = P{Z(i^„) = 0}. 

Note that < g{n) < n. Therefore (I4.22p implies 

g{n) + T{n) > g{ni) + g{n2) - nie{n2) - n2e{ni), Vni,n2>A;. (4.23) 

Since a < a*, we have that e{n) = 0(l/(logn)-'^''~^), for some e > 0. We then restrict ourself to 

ni,n2 > t{n) := n/(logn)'', with r] = e/3. 

This implies that n^"'' < 112 and (1 — r/)logn < logn2. So, for ni,n2 large enough, we have 

ni < n2(logn2)2'' (4.24) 
n2 < ni{\ognif'^ . (4.25) 

Going back to (|4.23p . we get 

g{n) + T{n) > g{ni) +5(722) - ni{lognif'^e{ni) - n2(log n2)^''e(n2), Vni,n2 > n/(logn)^'' 
or equivalently 

/(n) + A(n) > /(ni) + /(n2), Vni,n2>t(n) (4.26) 

where 

n(log n)^^e(n) 



But £ — 2ri = e/3 > 0, hence 



/H 


= g{n) 


A(n) 


= n{\o\ 


t(n) 


= n/(L 


A( 


[n) <o(- 







n 



,(logn)i+s/3, 

and we satisfy the hypothesis of Lemma El which implies that f{n)/n converges, hence g{n)/n 
converges too. □ 
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4.4 Proofs of Lemma [2] and Theorem [T] 

Proof of Lemma\^ From Theorem [3l for every a < a* , ipn{oi) converges to a limit (/>s(a). Note 
that (psi') is a non-increasing function on [0, a*), hence, it has a countable number of plateaus and 
discontinuities. Let oq G [0,a*) and denote (j)o = cpsiao)- If an{4>o) does not converge, define 

Qq = lim inf a„((/>o), 

n— >oo 

Uk /■ oo s.t. lim anf^{(j)o) = a^, 

fc— >oo 

Qo = lim sup a„(0o), 

n— >oo 

OO S.t. lim 

k—>OD 

Then, for any a E (oq, oq), there exists e > such that 



mfc oo s.t. lim a^^icpo) = qq. 

k—>OD 



and 



P,n, (a, 0o) < P,n, (a™, (0o) - e, 0o) (4.27) 



Pn, (a, M > Pn, (an, (</>o) + -/-o) 1 (4.28) 



Moreover, if a < a*, 

P„,(a,(/.o) =P{— logZ(F(?ifc,a)) < 0o|^(F(nfc, a)) > l} + o(l), 

hence 

lim E[— logZ(F(nfc,a))|Z(F(nfc,a)) > 1] < (/<o, 
i.e., since 'tpn{oi) converges to (/>s(a) from LemmaEl 

(t>s{a) < 4>a. 

Similarly, we have 

lim ¥.[—\ogZ{F{mk,a))\Z{F{mk,a)) > 1] > (/-o, 

k—^oo Ttlk 

and 

</'s(a) > 4>G- 

Therefore, is a plateau of 0s(-), and since (ps{') has countably many plateaus, there are countably 
many (f)Q G (/>s([0, a*)), for which an{(t>o) does not converge. □ 

Proof of Theoremm From Theorem [3l there exists a function (psi'), such that for any a G [0,a*), 
we have (psict) = finin->oo V'n.(ct), where tpni') is defined in Theorem [3j Let / := (j)s{[0,a*)). From 
Lemma [21 there exists a countable set C I and a function A : / \ C — )• [0, a*) such that for any 
e / \C, we can define the limit j4((^) — lim.n_>oo Qn(<^). Note that for any G I \ C, Theorem [2] 
implies (/>s(A((/))) = (p. 

Now, for any a G [0, a*) which is not a discontinuity point of (ps (this holds except on a countable 
subset of [0, a*)), and for any e > 0, there exists e' < e such that := </>s(a) — e' G I \ C and hence 
Q^n (</>*) tends to a limit A^,. Note that > a, since a is not a discontinuity point of (pg and since 
(/)s(^*) = Therefore, there exists 5 > such that 

P„(a, (/)s(a) - e) < Pn{a, 0*) < P„(a„(0*) - 5, 0*) 

and we conclude from Theorem [2] that Pnict, (psi^t) — e) — )• when n — ?• oo. With a similar argument, 
we conclude that Pnia, (ps{oi) + e) — >• 1 when n — )• oo. □ 
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4.5 Proofs of Lemma [3], Lemma [4] and Lemma [5] 



Proof of Lemma O In this proof, we keep a fixed and spfit the n variables into two sets of ni and 
n2 = n — ni variables, such as {1, . . . , ni} and {ni + 1, . . . , n}. For convenience, we now work with 
the interpolated Poisson model. We construct a random Boolean formula as follows: we first draw 
independently the integers M, Mi and M2 under Poisson distributions of parameters ant, ani{l — t) 
and an2(l — t) respectively. We then draw independently M clauses from the full system, by picking 
for each clause the indices of the variables appearing in it independently and uniformly at random 
within the set of n variables and by picking 99 under fx. We also draw independently Mi clauses from 
each sub-systems, by picking for each clause the indices of the variables appearing in it independently 
and uniformly at random within the set of variables and by picking ipi under fi. Finally, we take 
the conjunction of all clauses to create the formula Fn{t). 
Note that the claim of the lemma is equivalent to 

Elog(l + Z{Fn{l))) > Elog(l + Z(F„(0))), (4.29) 

which is proved by showing that 

^Elog(l + Z(F„(t))) >0. 

An elementary calculation yields 

^iElog(l + Z{Fnit))) = a [E^,/Elog(l + A ip{xi))) - Elog(l + Z(F„(t)))] 

atn 

- a— [E^,,/.,Elog(l + Z{Fn{t) A M^h))) - Elog(l + Z{Fn{t)))] 

n 

- [E^,,7,Elog(l + Z{Fn{t) A M^i,))) - Elog(l + , 
n 

where ip,ipi,ip2 fi, I ^ U^, Ii ^ U^, I2 ~ U2, all independent, and where U'', respectively U^, 
denotes the fc-th product measure of U, respectively Ui (where U, resp. Ui, denotes the uniform 
measure on the n variables, resp. rij variables). Hence, xj = (xj^ , . . . , Xi^) with ii, . . . , iid uniform 
over the n variables. 
We then have 

E^,7Elog(l + Z{Fn{t) A V9(x/))) - Elog(l + Z(F„(t))) = E^jElog{^{Xi)) 

where X is uniformly drawn within the augmented solution space S{Fn* (t)) = S{Fn{t)) U {*}, where 
* is an assignment which returns true on any Boolean functions, and ( • ) denotes the expectation 
with respect to X. Note that 

E^jElogiifiXi)) = -E^,jE^ i'fi^^i))^ (4_30) 

1=1 

where = 1 — (p. We now introduce the 'replicas' X^'^\ which are independent and identically 
distributed copies of X. We then have 

I 

mxj)y = {ll^ixP)), v/>i. 

r=l 
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We are done if we can show that for any reahzations of the X^^^^s and for any I > 1, 

E.,/ ri ^(^P) - -^^^^^ ri - -^^.^^ ri '^(^s:^) ^ 0- (4.31) 

r=l r=l r=l 

Note that 

I I 
E^j II ^{XP) = E^Ep II (^(^W) (4.32) 

r=l r=l 

where ■ ■ ■ , ^^'^ ~ -P and where P is the empirical distribution of Xj^\ . . . , i.e. the distribu- 
tion on {—1,1}'^' given by 

k 
1=1 

with 

p. (1) m^ _ #{^ £{!,..., n}:(xf\...,xf) = (xf\...,xf))} 

j-l^x^ , . . . ; — ^ 

and similarly 



E^j^ll^{X^^)=E^Ep^ll^{(i^-^), s = l,2 (4.33) 



r=l r=l 



where . . . ,^s^ ~ Ps and where is the empirical distribution of Xj^\ . . . ,xf \ i.e. the distri- 



bution on { — 1,1}'^' given by 

k 

p (A^) ^(1) ^(0 _ TT p /' (1) (0\ „ _ 1 9 

-^sV-^l '■■■'■^fc / — J.J. j J-'-J-^j /; * — 

1=1 

with 

p . (1) fO^ _ #{^£{l,...,M:(xf^...,xf) = (xf^...,xf)} 

ni 

p , (1) au _ #{z g {ni + 1, . . . ,n} : (xf ), . . .,xf>) = (xf . . .,x?)} 

n2 

Now, using the operator F/ defined by p.ip in H2, i.e., 

I 

Ti : Mi({-1, 1}') E^E^M ^(1 - (^(Z^)) 

r=l 

where Z^"^^ are Boolean random vectors of dimension k such that Zi = {zf'\ . . . , zf^), i = 1, . . . ,k, 
are i.i.d. with distribution v, note that ()4.3ip is equivalent to 

r^P) - ^r^A) - -rKP2) > 0, 

n n 



which holds by convexity of Ti , since 



m - no - 

P = —Pi + — P2. 



□ 
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Proof of Lemma O We have 

I 

r=l 

2(i),...,^(0e{-i,i}'^ ''=1 

k 

=E.o) E nK(^«).,...,(z«).) 

^(i),,,.,2(0e{s(l),...,s(-^)}i=l 
jl,...,jiG{l,...,J} 

and F; is convex for any / > 1. □ 

Proof of Lemma [7} We need to check the convexity of 

I 

i/^E,E^MfJ(l-(^,(zM)) (4.34) 

r=l 

where Z^^^ are Boolean random vectors of dimension k such that Zi = {Z^^\ . . . , Z^''^), i = 1, . . . ,k, 
are i.i.d. with distribution z^, 

k 

iPsix) = l{Y\_Xi = S) 

i=l 

and 

H{(pi) = fi{(p-i) = 1/2. 

Note that 

I k 

E^M - (^.(ZM)) = F{Y[Z, = -s'} (4.35) 

r=l 1=1 

where — denotes the vector (— s, . . . , —s) with / components and where Y[i=i denotes the 
component-wise product of the vectors Zi. Since the Zi are i.i.d. under ly and valued in { — 1,1}, 
and since we are interested in their product, we now work with their Fourier transform. For any 
QC{1,...,/}, let 

f{Q) = fzdQ)=^ll^i^- 
Note that we can recover the distribution of Zi by knowing f{Q) for any Q, in particular 

F{Z, = 1'} = E /(^) 

and 
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Moreover, 



hence, 



1=1 



/nt,z,(^?) = /(Q)' 



Since f{Q) is linear in u (it is the expectation of IlreQ ^i'^ where {Z\^' , . . . , Z^') ~ u), the above 
summation is clearly convex in z/ if fc is even. □ 



.(1) 
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